Multi-lingual duration modeling

نویسندگان

  • Jan P. H. van Santen
  • Chilin Shih
  • Bernd Möbius
  • Evelyne Tzoukermann
  • Michael Tanenblatt
چکیده

Controlling timing in text-to-speech synthesis systems is complicated, because there are many contextual factors that affect timing; moreover, which factors matter and what their precise effects are varies among languages. We describe here a language-independent approach for duration control. At run time, a language-independent timing module accesses languagespecific tables. These tables specify which sub-classes of the feature space (i.e., all combinations of context and phone identity) are homogeneous in the specific sense that the same factors have similar effects on the cases in a sub-class. Within a sub-class, durations are modeled by simple arithmetic models such as multiplicative, additive, or – more generally – sums-ofproducts models. Exploratory statistical methods (supervised) and parameter estimation techniques (unsupervised) are used for

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lingual orthodontic treatment duration: performance of two different completely customized multi-bracket appliances (Incognito and WIN) in groups with different treatment complexities

INTRODUCTION The occurrence of side-effects of fixed orthodontic therapy, such as white-spot lesions and root resorption, are known to be significantly more frequent with increasing duration of treatment. Multi-bracket treatment should be as short as possible, in order to minimize the risks of collateral damage to teeth. The aim of this non-randomized clinical trial was to compare treatment dur...

متن کامل

N-Gram Language Modeling for Robust Multi-Lingual Document Classification

Statistical n-gram language modeling is used in many domains like speech recognition, language identification, machine translation, character recognition and topic classification. Most language modeling approaches work on n-grams of terms. This paper reports about ongoing research in the MEMPHIS project which employs models based on character-level n-grams instead of term n-grams. The models ar...

متن کامل

Multi-lingual and Multi-modal Speech Processing and Applications

Over the last decade voice technologies for telephony and embedded solutions became much more mature, resulting in applications providing mobile access to digital information from anywhere. Both a growing demand for voice driven applications in many languages and the need for improved usability and user experience now drives the exploration of multi-lingual speech processing techniques for reco...

متن کامل

Experiments in Cross Language Query Focused Multi-Document Summarization

The twin challenges of massive information overload via the web and ubiquitous computers present us with an unavoidable task: developing techniques to handle multilingual information robustly and efficiently, with as high quality performance as possible. Previous research activities on multilingual information access systems have studied cross-language information retrieval (CLIR), information ...

متن کامل

FST-based recognition techniques for multi-lingual and multi-domain spontaneous speech

In this paper we present techniques for building multi-domain and multi-lingual recognizers within a finite-state transducer (FST) framework. The flexibility of the FST approach is also demonstrated on the task of incorporating networks modeling different types of non-speech events into an existing word lattice network. The ability to create robust multi-domain and/or multi-lingual recognizers ...

متن کامل

Onset of action and duration of efficacy of inferior alveolar nerve block versus single lingual subperi-osteal injection of 4% articaine in mandibular second molars: A randomized clinical trial

Background and Aim: Achieving adequate pulpal anesthesia could be challenging in mandibular molars. There are some disagreements about the success rate of local infiltration anesthesia with articaine as primary injection. Therefore, the aim of this study was to assess the efficacy of 4% articaine lingual subperiosteal injection as the primary injection for permanent mandibular second molars in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997